Re-Ranking System with BERT for Biomedical Concept Normalization

نویسندگان

چکیده

In recent years, various neural network architectures have been successfully applied to natural language processing (NLP) tasks such as named entity normalization. Named normalization is a fundamental task for extracting information in free text, which aims map mentions text gold standard entities given domain-specific ontology; however, the biomedical domain still challenging because of multiple synonyms, acronyms, and numerous lexical variations. this study, we regard ranking problem, propose an approach rank normalized concepts. We additionally employ two factors that can notably affect performance normalization, task-specific pre-training (Task-PT) calibration approach. Among five different benchmark corpora, our experimental results show proposed model achieved significant improvements over previous methods advanced state-of-the-art with up 0.5% increase accuracy 1.2% F-score.

برای دانلود رایگان متن کامل این مقاله و بیش از 32 میلیون مقاله دیگر ابتدا ثبت نام کنید

اگر عضو سایت هستید لطفا وارد حساب کاربری خود شوید

منابع مشابه

Visual Search Optimization using Concept Related Re-Ranking

Visual search re-ranking defined as re-ordering visual documents like image, videos etc. based on the initial search. Ranking the multimedia content like images, videos are a challenging research topic in the noisy visual environment. Now days, leading search engines are fully depends on the description, title, surrounding information of an image which produce irrelevant image which are not equ...

متن کامل

BIOTEX: A system for Biomedical Terminology Extraction, Ranking, and Validation

Term extraction is an essential task in domain knowledge acquisition. Although hundreds of terminologies and ontologies exist in the biomedical domain, the language evolves faster than our ability to formalize and catalog it. We may be interested in the terms and words explicitly used in our corpus in order to index or mine this corpus or just to enrich currently available terminologies and ont...

متن کامل

KISTI at TREC 2014 Clinical Decision Support Track: Concept-based Document Re-ranking to Biomedical Information Retrieval

With fast development of medical information systems and software, clinical decision support (CDS) systems continue to develop new methods to deal with diverse information coming from heterogeneous sources such as a large volume of electronic medical records (EMRs), patient genomic data, existing genomic pharmaceutical databases, curated disease-specific databases, peer-reviewed research, etc. ...

متن کامل

Biomedical term normalization of EHRs with UMLS

This paper presents a novel prototype for biomedical term normalization of electronic health record excerpts with the Unified Medical Language System (UMLS) Metathesaurus. Despite being multilingual and cross-lingual by design, we first focus on processing clinical text in Spanish because there is no existing tool for this language and for this specific purpose. The tool is based on Apache Luce...

متن کامل

Ranking Biomedical Annotations with Annotator's Semantic Relevancy

Biomedical annotation is a common and affective artifact for researchers to discuss, show opinion, and share discoveries. It becomes increasing popular in many online research communities, and implies much useful information. Ranking biomedical annotations is a critical problem for data user to efficiently get information. As the annotator's knowledge about the annotated entity normally determi...

متن کامل

ذخیره در منابع من


  با ذخیره ی این منبع در منابع من، دسترسی به آن را برای استفاده های بعدی آسان تر کنید

ژورنال

عنوان ژورنال: IEEE Access

سال: 2021

ISSN: ['2169-3536']

DOI: https://doi.org/10.1109/access.2021.3108445